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Abstract 

In this paper we answer the following question: what is the lexicographically least 
sequence over the natural numbers that avoids |-powers? 

1 Introduction 

Ever since the work of Thue more than a hundred years ago, mathematicians have been 
interested in avoiding patterns in words. Thue showed that it is possible to create an infinite 
sequence over a three-letter alphabet containing no nonempty squares (that is, no factors of 
the form xx), and over a two- letter alphabet containing no overlaps (that is, no factors of 
the form axaxa, where a is a single letter and x is a possibly empty word). 

Dejean [3] instituted the study of fractional powers in 1972. We say that a word a; is a 
p/g-power, for integers p > q > 1, if sc can be written in the form y e y' for some integer e > 1, 
where y' is a prefix of y and = p/q- For example, the German word schematische is 

a 3/2-power. 

Recently there has been some interest in avoiding patterns over an infinite alphabet, say 
N, the natural numbers. More precisely, we are interested in finding the lexicographically least 
infinite sequence over N avoiding p/g-powers, by which we mean a sequence not containing 
any factor that is an a-power with a > p/q. As shown in [3], this corresponds to running a 
backtracking algorithm without actually doing any backtracking; whenever a choice for the 
next symbol fails, we increment the choice by 1 until a valid choice is found. 

Guay-Paquet and the second author [3] recently described the lexicographically least 
words avoiding squares and overlaps over N. The lexicographically least square-free word 
over N is 

01020103010201040102010301020105 ■ ■ ■ . 



Indexing from 0, the nth symbol in this sequence is the exponent of the highest power of 2 
dividing n + 1. This sequence is 2-regular in the sense of Allouche and Shallit [TJ [2]. The 
lexicographically least overlap-free word can also be described compactly, but it is more 
complicated. 

In this paper we are interested in w 3 / 2 , the lexicographically least word over N avoiding 
|-powers. As we will see, this word has a short description, and it is 6-regular. 
Here are the first 100 terms of w 3 / 2 : 
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2 The word w 3 / 2 

We define an infinite sequence v = o(0)o(l)o(2) ■ • • as follows: 

a(10n) : 

a(10n + l) = 

a(10n + 2) : 

a(ion+4) = r P "ir r~~«'' w 

a(10n + 9) : 
Thus we have 

v = 01203102130120410214012031021501204102130120310214- • • . 

Note that v has both periodic aspects (in that some linearly-indexed subsequences are con- 
stant) and self-similar aspects (in that a(60n + 59) = a(10n + 9) + 2). 

Our goal is to show that v = W3/2. To do so we will prove that (a) v is |-power-free and 
(b) v is lexicographically least among all words avoiding |-powers. 
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3 v is |-power-free 



If w is a |-power with | > |, then w contains a factor of the form xyx, where either \y\ = \x\ 
or \y\ = \x\ — 1. We show that neither of these factors appears in v. 

First, we observe that from the definition, v is "pseudoperiodic" with period 10. More 
precisely, 

v G (0120{3,4}1021{3,4,...};r, 

where by x^ we mean the infinite sequence xxx ■ ■ ■ . From this we immediately see that if 
a(n)a(n + 1) = a(n + i)a(n + i + 1) for some n > and i > then i = (mod 10). It now 
follows that if \y\ — \x\ — 1 > 1 then, since \xy\ is odd, xyx is not a factor of v. Similarly, if 
| x | = 1 then xx is not a factor of v. 

We now show that no word xyx, with \x\ = \y\ = k > 1, occurs as a factor of v. For each 
factor a(n)a(n + 1) • • • a(n + 3k — 1) of length 3k we exhibit an index i, < i < k, such that 
a{n + i) 7^ a{n + 2k + i) ) implying that this factor is not a |-power. The proof is divided up 
into several cases, depending on the residue class of k modulo 10. 

Case 1: k = 1, 4, 6, 9 (mod 10). Then from ([I]) we have a{n) ^ a{n + 2k) for all n, so we can 
take i = 0. 

Case 2: k = 2, 7 (mod 10). If n = 1,6 (mod 10), then let i = 1 (which we can do since 
A; > 1); otherwise let i = 0. One checks that a{n + i) ^ a{n + 2k + i). 

Case 3: k = 3, 8 (mod 10). If n = 0, 5 (mod 10), let i — 1; otherwise let z = 0. 

Case ^: k = 5 (mod 10). Then k > 5. 

If = 5 then we can choose i such that either n + i = 4 (mod 10) or n + z = 9 (mod 10). 
In the former case we have a(n + i) — 3, a(n + 2k + i) = 4 or vice versa. In the latter case 
we have a(n + i) ^ a{n + 2k + i) by ([T]). 

Otherwise A; > 15. Then we can choose i such that n + i = 4 (mod 10). We have 
2/c = 10 (mod 20), and hence a(n + i) ^ a(n + 2k + i) from (JT]). 

Case 5: A; = (mod 10). Define b(n) 
n > 0, that 

6(6n) = 
fo(6n+l) = 

6(6n + 2) = 

6(6n + 5) = 



a(10n + 9) for n > 0. Then from ([T]) we have, for 



6(6n + 3) = 3; 
b(Qn + A) = 4; 

5, if n = (mod 2); 

6, if n = 1 (mod 2); 
6(n) + 2. 



(2) 
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From this, an easy induction gives 



b(n) 



' 2t + 3, if the base-6 representation of n ends with 05* or 35*; 

2t + 4, if the base-6 representation of n ends with 15* or 45*; 

2t + 5, if the base-6 representation of n ends with 025*, 225* or 425 

k 2t + 6, if the base-6 representation of n ends with 125*, 325* or 525 



(3) 



We need to find an index i, < i < k, such that a(n + i) ^ a(n + 2k + i). We choose i 
such that n + i = 9 (mod 10). Since k = lOr, there are r possible choices for i. It follows 
that we need to show, for each n > and r > 1, that there exists j, < j < r such that 



b{n + j) 7^ b{n + 2r + j). 



(4) 



If r = 1, 2 (mod 3), then in fact we can choose j = and use 02]). 

Otherwise r = (mod 3). This is the most difficult case. To solve it, we first define two 
auxiliary sequences, as follows: 



c(0) 
c(6n + 1) 
c(6n + 2) 
c(6n) 



0; 

c(6n + 3) 
c(6n + 4) 
6c(n) + 5; 



c(6n + 5) 
5; 



d(0) = 0; 
d(6n+l) = d{6n + 5) = 3; 
rf(6n + 2) = d(6n + 3) = d(6n + 4) = 6; 
d(6n) = 6d(n). 

Write r = 3s, and let the base-6 representation of s end in exactly t zeroes. An easy 
induction gives, for s > 1, that 



c[s) 



6* +1 — 1, if the last nonzero digit of s is even; 
3 • 6* — 1, if the last nonzero digit of s is odd; 



and 

6* +1 , if the last nonzero digit of s is 2, 3, or 4; 
3-6*, if the last nonzero digit of s is 1 or 5. 

We now claim that for all s > 1 and j > 0, we have 



d(s) 



b{d{s)j + c(s)) ^ b{d{s)j + c{s) + 6s). (5) 
Since c(s) < d(s) < 3s = r, @ also provides a solution to (jl]). 
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To verify (JSJ), we assume that s ends in t O's. Then there are 30 cases to consider, based 
on the last nonzero digit s' of s and the last digit f of j (digit in the base-6 representation, 
of course). The claim can now be verified by a rather tedious examination of the 30 cases. 
We provide the details for three typical cases: 

s' = 1, j' = 0: the base-6 expansion of d(s)j + c(s) ends with Z25* for some t > and 
/ G {0, 3}, while the base-6 expansion of d(s)j + c(s) + 6s ends with (7 + 1)25*, so by ([3]) we 
see that b(d(s)j + c(s)) and b(d(s)j + c(s) + 6s) are of different parity. 

s' = 1, j' = 3: the base-6 expansion of d(s)j + c(s) ends with 15* or 45* for some t > 0, 
while the base-6 expansion of d(s)j + c(s) + 6s ends with 25* or 5* +1 . From ([3]) we get 
b(d(s)j + c(s)) = 2t + 4, while b(d(s)j + c(s) + 6s) > 2t + 5. 

s' = 1, j' = 5: the base-6 expansion of d(s)j + c(s) ends with Z5* +1 for some I 6 {2, 5}, while 
the base-6 expansion of d(s)j + c(s) + 6s ends with ((/ + 1) mod 6)5* +1 . In either case, from 
© we have b(d(s)j + c(s)) >2t + 7, while b(d(s)j + c(s) + 6s) = 2t + 5. 

The proof that v is |-power free is now complete. 

4 v is the lexicographically least |-power-free sequence 

In this section we complete our characterization of w 3 / 2 by showing that a |-power is formed 
whenever any symbol of v, other than 0, is decremented. 

The symbol appears in positions congruent to 0,3, and 6 (mod 10), so it suffices to 
examine other positions. 

If the symbol is in a position congruent to 1, 5, or 8 (mod 10), then it is a 1, and it follows 
0, 02, 03, or 04. Decrementing this to then produces the square 00 or one of the |-powers 
020, 030, or 040. 

If the symbol is in a position congruent to 2 or 7 (mod 10), then it is a 2, and it follows 
01 or 10. Decrementing to produces 010 or 00, while decrementing to 1 produces 11 or 
101. 

If the symbol is in a position congruent to 4 (mod 10), then it is either 3 or 4, and it 
follows 0120. Decrementing to produces 00; decrementing to 1 produces the 5/3-power 
01201; and decrementing to 2 produces 202. If the symbol is 4, there is also the possibility 
of decrementing to 3, and it occurs at a position > 14. If the 4 is decremented to 3, then 
the immediately preceding 15 symbols (including the 3) form a |-power. 

The last case is that the symbol is in a position congruent to 9 (mod 10). Then this 
position is 10n + 9 for some n, and the symbol is a(10ri + 9) = b(n). It therefore follows 1021. 
If it is decremented to 0, this produces 10210. If it is decremented to 1, it produces 11. If it 
is decremented to 2, it produces 212. We now have to handle the possibility of decrementing 
to m, for some m with 3 < m < b{n). 
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If n = (mod 3), then bin) = 3, so there are no other possibilities to consider. If 
n = 1 (mod 3), then b(n) = 4, so we also have to consider the possibility of decrementing to 
3. In this case, the immediately preceding 15 symbols (including the 3) form a |-power. 

It remains to consider the case when n = 2 (mod 3). Here b(n) is at least 5. If we 
decrement b{n) to 3, then the immediately preceding 30 symbols form a |-power, while if we 
decrement to 4, then the immediately preceding 15 symbols form a |-power. 

For all other cases, we replace b(n) by m > 5. We claim this gives the |-power xyx in v, 
where x is of length £ m , in the preceding 3£ m symbols, where 



'30 


6 2 6 , 


if 6(n) 


is 


odd and m is even; 


60 


m+1 _n 
6 2 d ; 


if b{n) 


is 


odd and m is odd; 


< 30 


6 2 d , 


if 6(n) 


is 


even and m is even; 


60 


„ m+1 _o 
6 2 d ; 


if b{n) 


is 


even, m is odd, and m ^ b(n) — 1 


30 
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m+1 q 
6 2 d , 


if 6(n) 


is 


even and m = b(n) — 1. 



To see this, note that from ([T]) it is enough to show that 

b(n + 1 - 3£ m /10) • • • 6(n - C/5 - 1) = b(n + 1 - £ m /10) ■ ■ ■ b{n - 1) (6) 

and 6(n — £ m /5) = m. This can be done by a tedious examination of each case in ([3]). We 
give here one representative case, leaving the rest to the reader. 

Suppose the base-6 representation of n ends with 35* for t > 1 and so b(n) = 2t + 3. 

Case 1: m > 6 is even. We have £ m /10 = 3 • 6^"~ 3 . The base-6 expansion of n + 1 — £ m /10 is 
35* _l_1 30 l , while the base-6 expansion of n — 1 is 35 <_1 4, where 1 < z < t — 2 and m = 2i + 6. 

On the other hand, the base-6 expansion of n + 1 — 3£ m /10 is 35*~ l_1 430*, while the 
base-6 expansion of n — £ m /5 — 1 is 35 <_ * _1 45M. Using (jSJ) we see that b takes the same 
values on these intervals. On the other hand, b takes the value m = 2i + 6 at 35* _;i_1 45*5. 
Thus changing b(n) to m forms a |-power. 

Case m = 6. Then £ m /10 = 3. It is now easy to see that b(n— 5)b(n— 4) = b(n— 2)b(n— 1) = 
34, while 6(n — 3) = 6. Thus changing b(n) to m forms a |-power. 

Case 3: m is odd. We have £ m /10 = 6 I2 2 li_3 . The base-6 expansion of n + 1 — £ m /10 is 
35* _ * _1 0* +1 , while the base-6 expansion of n — 1 is 35* _1 4, where 1 < i < t — 2 and m = 2i + 5. 

On the other hand, the base-6 expansion of n + 1 — 3£ m /10 is 35* - *~ 2 30* +1 , while the 
base-6 expansion of n — £ m /5 — l is 35*~*~ 2 35M. Using ()3]) we see that b takes the same values 
on these intervals. On the other hand, b takes the value m = 2i + 5 at 35 <_ *~ 2 35* +1 . Thus 
changing b{n) to m forms a |-power. 
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5 Morphism description 

The sequence w 3 / 2 can also be generated as follows. Consider the morphisms tp and r defined 
by 

(p(n) = 33443 (n + 2) 
V?(n) = 43344(^T2) 
r(n) = 0120n 
r(n) = 1021n. 

Equation ([2]) is equivalent to the statement that the sequence 3 6(0) 4 6(1) 3 6(2) 4 6(3) • • • is 
a fixed point of <p; hence this sequence is y? w (3). It follows that w 3 / 2 = T((p u (3)). 

6 Avoiding only |-powers 

In this section we consider a variant of w 3 / 2 . Whereas w 3 / 2 is the lexicographically least 
word over N not containing any a-power for a > 3/2, we let x 3 / 2 be the lexicographically 
least word over N not containing any (exact) |-power. Here are the first 144 terms of x 3 / 2 : 
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Unlike w 3 / 2 , the word x 3 / 2 contains squares, for example. However, the underlying 
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structures of w 3 / 2 and x 3 / 2 are the same. Namely, let y = /(0)/(l)/(2) • • • , where 



/(12n + 8) =0, n>0; 
/(12n + 10) = 1, n>0; 



Note that /(12n + 11) + 1 = 6(n) = a(10n + 9) for n > 0. We show that y = x 3/2 . In 
particular, it follows that x 3 / 2 is 6- regular. 

First we show that y is |-power-free. As in Section [31 for each n > and k > we 
exhibit an index i, < i < k, such that f(n + i) ^ f(n + 2k + i). 

Case 1: k = 1, 5, 7, 11 (mod 12). We have /(n) ^ /(n + 2Jfc) for all n, so let i = 0. 

Case 2: k = 2,4, 8, 10 (mod 12). If n is even, let i — 1; if n is odd, let z = 0. 

Case 5: = 3, 9 (mod 12). If n is even, let % = 0; if n is odd, let z = 1. 

Case 4: k = 6 (mod 12). If /c = 6 then we can choose i such that either n + z = 5 (mod 12) 
or n + z = 11 (mod 12); in either case we have f(n + z) ^ f(n + 2k + z"). Otherwise > 18, 
and we can choose z such that n + z = 5 (mod 12), and hence f(n + z) ^ /(n + 2A; + z"). 

Case 5: = (mod 12). Since f(12n + 11) + 1 = b(n), this case follows immediately from 
Case 5 of Section [3j 

Therefore y is |-power-free. Now we show that decrementing any nonzero symbol of y 
introduces a |-power. 

The symbol in each position congruent to 0, 1, 4, 7, or 8 modulo 12 is 0, which cannot 
be decremented. 

The symbol in each position congruent to 2, 3, 6, 9, or 10 modulo 12 is 1, and it follows 
00, 01, 02, or 03. In each case, decrementing the 1 to produces a |-power. 

The symbol in each position congruent to 5 modulo 12 is either 2 or 3, and it follows 00110. 
Decrementing to produces 001100, and decrementing to 1 produces 101. If the symbol is 
3, then decrementing to 2 produces a |-power in the preceding 18 symbols (including the 2). 

The symbol in each position congruent to 11 modulo 12 is at least 2. Decrementing to 
produces 100110. Decrementing to 1 produces 111. If n = (mod 3), then /(12n+ 11) = 2, 
so there are no other possibilities to consider. If n = 1 (mod 3), then /(12n + 11) = 3, and 
decrementing to 2 produces a |-power in the preceding 18 symbols. 

If n = 2 (mod 3), then f(12n + 11) > 4. Decrementing to 2 or 3 produces a |-power in 
the preceding 36 or 18 symbols, respectively. It follows from ([6]) that if /(12n + 11) > m > 4 



/(12n) 
/(12n + 2) 

/(12n + 5) 



/(12n+ll) 



/(12n + l) = /(12n + 4) = /(12n + 7) 
/(10n + 3) = /(12n + 6) = /(12n + 9) 
'2, ifn = 0(mod2); 
3, if n = 1 (mod 2); 

2, ifn = 0(mod3); 

3, if n = 1 (mod 3); 
/(2n + l) + 2, ifn = 2(mod3). 
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then decrementing /(12n + 11) = b(n) — 1 to m produces a |-power the preceding y£ m+ i 
symbols. 

We have shown that y is the lexicographically least |-power-free word over N; hence 
x 3/2 = y- The word x 3 / 2 is generated by the morphism cp of the previous section as x 3 / 2 = 
t>(<£> w (3)), where v is the morphism defined by 

v(n) = OOllO(n-l) 
v(n) = lOOll(n-l). 

As already mentioned, x 3 / 2 contains squares. However, the only squares in x 3 / 2 are 00 and 
11, as we now show. Suppose xx is a factor of x 3 / 2 for some nonempty word x. The length of 
x cannot be even, because otherwise xx contains a |-power. One checks from the definition 
of f(n) that the length of x cannot be 3. Similarly, f(n) ■ ■ ■ /(n+4) ^ f(n+k) ■ ■ ■ f(n+4 + k) 
for all n > and odd k, which precludes a factor xx where the length of x is odd and at 
least 5. 

Hence the only square factors of x 3 / 2 are 00 and 11. Since 000 and 111 are not factors of 
X3/2, it follows that x 3 / 2 is overlap-free. 

7 Remarks 

A previous paper discussed the structure of the lexicographically least word over N avoiding 
n'th powers or overlaps 0j. In this paper, we discussed the structure of such a word avoiding 
3/2-powers. It remains to gain a deeper understanding of the lexicographically least word 
avoiding a-powers for arbitrary a. Computer experiments strongly suggest that a similar 
structure exists for 4/3-powers. But it is still not known whether the lexicographically least 
word avoiding 5/2-powers uses only finitely many distinct letters. 
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